Towards Cognitive Automation of Data Science

نویسندگان

  • Alain Biem
  • Maria Butrico
  • Mark Feblowitz
  • Tim Klinger
  • Yuri Malitsky
  • Kenney Ng
  • Adam Perer
  • Chandra Reddy
  • Anton Riabov
  • Horst Samulowitz
  • Daby M. Sow
  • Gerald Tesauro
  • Deepak S. Turaga
چکیده

A Data Scientist typically performs a number of tedious and time-consuming steps to derive insight from a raw data set. The process usually starts with data ingestion, cleaning, and transformation (e.g. outlier removal, missing value imputation), then proceeds to model building, and finally a presentation of predictions that align with the end-users objectives and preferences. It is a long, complex, and sometimes artful process requiring substantial time and effort, especially because of the combinatorial explosion in choices of algorithms (and platforms), their parameters, and their compositions. Tools that can help automate steps in this process have the potential to accelerate the time-to-delivery of useful results, expand the reach of data science to non-experts, and offer a more systematic exploration of the available options. This work presents a step towards this goal.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cognitive Automation of Data Science

This paper explores how an automated procedure may leverage domain knowledge and reasoning to further automate Machine Learning (ML) and Data Science in a manner that may be thought of as cognitive. To this end, we first describe key features that we believe a cognitive automation system for data science must possess. The goal of a system embodying this concept would be to extend existing data-...

متن کامل

Towards an Operational Definition of Critical Thinking

This paper offers a state-of-the-art working definition for the concept of Critical Thinking (CT hereafter) in an attempt to provide a framework for the development of an operational definition for this complex concept. Having studied various definitions and models, proposed for CT by major figures in the field, the key defining features of this rich concept were identified and classified. Base...

متن کامل

Finding Trends in Human-Automation Interaction Research in Order to Formulate a Cognitive Automation Strategy for Final Assembly

This article presents a literature review within the area of Human-Automation-Interaction in order to find trends and central factors in recent HAI research. These factors will then be used in order to suggest a cognitive automation strategy for final assembly. Trends within final assembly is towards take individual aspects into account, choose an appropriate level of automation and investigate...

متن کامل

Cognitive Support for Human-Guided Mapping Systems

The semantic web envisions the Internet as a globally linked database, one that supports data interoperability and machine readable semantics. The “back-bone” of the semantic web is structural representations of domains of knowledge in the form of ontologies. A critical prerequisite to supporting this global information exchange is that mappings must exist between domain related ontologies. The...

متن کامل

Performance Knowledge Discovery for Modeling

Performance modeling has long been considered a difficult science requiring expertise from seasoned capacity planners. Currently there has been a shift in the area of performance modeling towards ease-of-use and automation of the entire process. Automation not only hides this difficult science from the users but also delivers immediate Return on Investment for administrators with deploy and run...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015